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Abstract. Under K.-T. Sturm's formulation, we obtain a Gaussian upper bound for 
tail probability of mean value of independent, identically distributed random variables 
with values in M-trees and Hadamard manifolds. 

1. Introduction and statement of the main result 

The aim of this paper is to study the weak Law of Large numbers for CAT(0)-space- 
valued stochastic processes (see Subsection 2.1 for the definition of CAT(O)-spaces). 

Let be a CAT(0)-space and {Q, S,P) a probabihty space. Given a random variable 
W : Q ^ N such that the push-forward measure W^F of P by has the finite moment 
of order 2, we define its expectation Kp(W) by the barycenter of the measure W^^F (the 
definition of the barycenter is in Subsection 2.1). In |8|, Theorem 4.7], K.-T. Sturm 
introduced a natural definition of mean value of n-points yi, - ■ ■ , in A^, called inductive 

mean value and denoted by ^ ^ yi (see Definition 12.51 for precise definition). For an 

independent, identically distributed A^-valued random variables (l^j)^i on the probability 
space fl, he obtained the weak Law of Large numbers proving the following inequality 

(1.1) [ dN(- V y;(^),Ep(ri))'dP(^) < - / dN{Yi{u;),Ep{Yi)fdF{u;). 

He also proved the strong Law of Large numbers ([HI Theorem 4.7, Proposition 6.6]). 

Motivated by Sturm's work, using the results of the theory of Levy-Milman concentra- 
tion of 1-Lipschitz maps obtained in |H |5], we obtain the following Gaussian estimate. 

Theorem 1.1. Let {Yi)°Zi be a sequence of independent, identically distributed random 
variables on a probability space {Q, S,P) with values in an W-tree T. We assume that the 
support of the measure (yi)*P has bounded diameter D. Then, for any r > 0, we have 

p(|cuGfi|dT(- YI >'i(^),Ep(Fi)) > r|) < 4e^e-iSi^. 
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See Subsection 2.1 for definition of M-trees. 

In the case where N is an Hadamard manifold, we also obtain the following. For any 
m G N, we put 

, /If„(m+l)/(4m-2)„7r2 _ 

:= ei/(2-)|i + ^ _ ^1 and := e^/(^"^Hl + v^e^'^+^^Z^^'^-^)}. 

Note that both A^ and Am are bounded from above by universal constant C > 0. 

Theorem 1.2. Let (i^j)^i be a sequence of independent, identically distributed random 
variables on a probability space (fi, E,P) with values in an m- dimensional Hadamard 
manifold N. We assume that the support of the measure (Yi)*P has bounded diameter D. 
Then, for any r > 0, we have 

P(^|cj en\ dN(^- Yi{u),Ep{Yi)^ > r|) < mm{Ame-^^,Ame-i^} 

1=1, ■■■ ,n 

There are many other way to define a mean value of points in a CAT(0)-space (see 
Remark 12.61) . For example, in |2j, A. Es-Sahib and H. Heinich introduced an another 
notion of mean value and expectation. They obtained the strong Law of Large numbers 
under their definition. In this paper, we treat only Sturm's formulation. 

Acknowledgements. The author would like to thank to Professors Kazuhiro Kuwae and 
Daehong Kim for motivating this work. This work was partially supported by Research 
Fellowships of the Japan Society for the Promotion of Science for Young Scientists. 



2. Preliminaries 

2.1. Basics of CAT(0)-spaces. In this subsection we explain several terminologies in 
geometry of CAT(0)-spaces. We refer to [8] for the details of the results on CAT(0)-spaces 
mentioned below. 

Let (X, dx) be a metric space. A rectifiable curve 7 : [0, 1] — X is called a geodesic if 
its arclength coincides with the distance c?x(7(0), 7(1)) and it has a constant speed, i.e., 
parameterized proportionally to the arclength. We say that a metric space is a geodesic 
space if any two points are joined by a geodesic between them. If any two points are 
joined by a unique geodesic, then the space is said to be uniquely geodesic. A complete 
geodesic space X is called a CAT{0)-space if we have 

dx{x, 7(1/2))' < ^ dx{x, yf + i dx{x, zf - ^ dx{y, zf 

for any x, 2; G X and any geodesic 7 : [0, 1] ^ X from y to z. For example, Hadamard 
manifolds, Hilbert spaces, and R-trees are all CAT(0)-spaces. An M-tree is a complete 
geodesic space such that the image of every simple path is the image of a geodesic. 
It follows from the next theorem that CAT(0)-spaces are uniquely geodesic. 
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Theorem 2.1 (cf. [H Corollary 2.5]). Let N be a CAT{0)-space and 7,77 : [0, 1] ^ 6e 
two geodesies. Then, for any t e [0, 1], we have 

dNilit),r]it)) < (1 - t) dNim,Vm + t dNilil),Vi^)) 

Let be a CAT(0)-space. We denote by V'^{N) the set of all Borel probability measure 
u on N having the finite moment of order 2, i.e., 

dN{x,yfdv{y) < +00 

for some (hence all) x G A^. A point xq E N is called the barycenter of a measure 
u G V'^{N) if Xq is the unique minimizing point of the function 

N3x^ dN{x,yydiy{y) G R. 

J N 

We denote the point xq by h{v). It is well-known that every v G V'^{N) has the barycenter 
(P Proposition 4.3]). 
A simple variational argument implies the following lemma. 

Lemma 2.2 (cf. [SI Proposition 5.4]). Let H be a Hilbert space. Then, for each v G 
V^iH), we have 

h{u) = / ydu{y). 
Jh 

Let (f2, S,P) a probability space and A^ a CAT(0)-space. For an A^-valued random 
variables W : fl N satisfying W^F G V'^{N), we define its expectation Ep(/) G A^ by 
the point b{W^,¥). By Lemma [2.21 in the case where A^ is a Hilbert space, this definition 
coincides with the classical one: 

Ep(Vr) = [ W{uj)dF{uj). 
Jq 

The proof of the next lemma is easy, so we omit it. 

Lemma 2.3. Let N be a CAT{0)-space and v G V'^{N). Then, we have 

dNipiy), Supp z/) < diam(Supp u). 

Theorem 2.4 (Variance inequality, cf. [El Proposition 4.4]). Let N be a CAT{{])-space 
and V G V'^{N). Then, for any z E N, we have 

I {dN{z,xY - dN{b{u),xf}du{x) > dN{z,biu)f 

We now explain the inductive mean value introduced by Sturm in |8, Definition 4.6]. 

Definition 2.5 (Inductive mean value). Given a sequence (i/j)^i of points in a uniquely 
geodesic space X, we define a new sequence of points s„ G X, n G N, by induction as 
follows. We define si := yi and Sn '■= 7(l/n), where 7 : [0, 1] — >• X is the geodesic 
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connecting two points Sn-i and ?/„. We denote the point by ^ Yl Hi ^'^d call it the 

1=1, ■■■ ,n 

inductive mean value of the points yi, - ■ ■ ,yn- 

Remark 2.6. (1) If the space X is a non-linear metric space, then the point \ ^ Vi 

strongly depends on permutations of yi as we see the following example. For i = 1,2, 3, 
let Tj := {{i,r) \ r G [0,+oo)} be a copy of [0, +oo) equipped with the usual Euclidean 
distance function. The tripod T is the metric space obtained by gluing together all these 
spaces Tj, z = 1, 2, 3, at their origins with the intrinsic distance function. Let yi := (1, 1), 
j/2 := (2,1), and y^ := (3,1). Then, the inductive mean value of order 2/1,2/2,1/3 is the 
point (3, 1/2), whereas the one of order yi, 2/3, ?/2 is the point (2, 1/2). 

(2) There are many other way to define a mean value of points 2/1, ■■ ■ , ?/„ in a CAT(O)- 
space (see [H Remark 6.4]). For example, define a mean value as the barycenter of these 
points. Observe that this definition does not depend on order of the points (and so it is 
different from inductive mean value in general). 

2.2. Invariants of mm-spaces and measures. In this subsection we define several in- 
variants of mm-spaces and measures, which are needed for the proof of the main theorems. 

An mm-space X = (X, dx^l^x) is a complete separable metric space (X, dx) with a 
Borel probability measure ^x- Let y be a complete metric space and v a finite Borel 
measure on Y having separable support with the total measure m. For any k > 0, we 
define the partial diameter diam(i/, m — n) of v as the infimum of the diameter of Yq, where 
Yq runs over all Borel subsets of Y such that viYo) > m — k. Let X be an mm-space with 
mx '■= fix{X) and Y a complete metric space. For any k > 0, we define the observable 
diameter of X by 

ObsDiamy(X; — k) := sup{diam(/*(/ix), — k) \ f : X ^ Y is a. 1-Lipschitz map}. 

The idea of the observable diameter comes from the quantum and statistical mechanics, 
i.e., we think of fix as a state on a configuration space X and / is interpreted as an 
observable. 

Let X be an mm-space. Given any two positive numbers ki and K2, we define the 
separation distance Sep(X; ki, K2) = Sep(/ix; 1^1, 1^2) of X as the supremum of the number 
dx{,A.i, A2), where Ai and A2 are Borel subsets of X such that /ix(^i) > ^^i and /ix(^2) > 
K2, and we put 

dx{Ai,A2) := m.i{dx{xi,X2) \ Xi G Ai,X2 G A2]. 
The next two lemmas are easy to prove. 

Lemma 2.7 (cf. Section 3|.30]). Let X and Y be two mm-spaces and f : X ^ Y be 
a a-Lipschitz map such that f*{fix) = t^Y- Then, for any Ki, K2 > 0, we have 

Sep(F; Ki, K2) < a Sep(X; Ki, K2). 
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Lemma 2.8. Given two positive numbers ki and K2 such that ki > 1/2 and k,2 > 1/2; 
we have 

Sep(z/; Ki, K2) = 0. 

Lemma 2.9 (cf. ^ Section 3|.33]). Let X be an mm-space. Then, for any n, k' > with 
K > n' , we have 

ObsDiamK(X; —k) > Sep(X; 
See also [H Lemma 2.5] for the proof of the above lemma. 

Let be a CAT(0)-space and u G V'^{N). Given any k > 0, we define the central 
radius CRad(z/, 1 — k) as the infimum of p > such that z/(i?Ar(6(z/), p)) > 1 — k. Let X 
be an mm-space and a CAT(0)-space such that f:^{fix) ^ V'^{N) for any 1-Lipschitz 
map f : X ^ N. For any k > 0, we define 

ObsCRadAr(X; —k) := sup{CRad(/=K(px), 1 — k) \ f : X ^ N is a. 1-Lipschitz map}, 

and call it the observable central radius of X. 

From the definition, we immediately obtain the following lemma. 

Lemma 2.10 (cf. [6l Section 3i.31]). For any k > 0, we have 

ObsDiam]R(X; —k) < 2 ObsCRadK(X; —k). 

Observable diameters, separation distances, observable central radii are introduced by 
Gromov in ^ Chapter 3|] to capture the theory of the Levy-Milman concentration of 
1-Lipschitz maps visually. 

Given an mm-space X, we define the concentration function ax '■ (0, +00) ^ M of X 
as the supremum of ^x{X \ ^+r), where A runs over all Borel subsets of X such that 
^J''x{A) > 1/2 and is an open r-neighborhood of A. Concentration functions were 
introduced by D. Amir and V. Milman in [1]. 



3. Proof of the main theorem 
Lemma 3.1. Let N be a CAT{0) -space. Then, for any n eN, the map 

s„ : AT®" 9 (xi,X2,--- - V X, GAT 

n ^-^ 

1=1, ■■■ ,n 

is {l/n)-Lipschitz with respect to the -distance function on the product space N' 
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Proof. Assuming that the map s„_i is l/(n — 1)-Lipschitz, by Lemma [2. we have 

dN{Sn{{x^)1=l),Sn{{y^)7=l)) < (l " ^) dN {Sn-l{{Xi)i^i) , Sn-l{{yi)1=i)) + ^ dN{Xn,yn) 

1 1 1 

< (1 ) -y^dN{Xi,yi) + -dN{Xn,yn) 

i=l 

1 " 

= - ^dNixi,yi). 

i=l 

This completes the proof. □ 



To prove Theorem ll.H we need the following two theorems. 

Theorem 3.2 (cf. P, Lemma 5.5]). Let u be a Borel probability measure on an M.-tree 
such that V G V^{T). Then, there exists a 1-Lipschitz function : T ^ M such that 

CRad(z/, 1 - k) < CRad((v9,^)*(z/), 1 - k) + Sep (^u; ^, -^j 

+ Sep (^{ip^)^{u)] |j + Sep((v9^)*(z/); 1- k,1- k) 

for any n > 0. 

Theorem 3.3 (cf. [7, Corollary 1.17]). Let X = Xi ® ■ ■ ■ (8>X„ be a product mm-space of 
mm-spaces Xi with finite diameter Di, i = 1, - ■ ■ ,n, equipped with the product probability 
measure fix '■= fJ'Xi Cg) • • ■ C?) fix„ and the -distance function de^ '■= J27=i dxi- Then, for 
any 1-Lipschitz function / : X ^ R and any r > 0, we have 

(3.1) f^x{{x e X I \f{x)-E,,{f)\ > r}) < 2e-^'/2^^ 
where := Yl^=i ^i- Moreover, we have 

(3.2) ax{r)<e-^'/'^\ 

Proof of Theorem Let s„ : T®" T be a map which sends every point in T®" to its 
inductive mean value. Putting v := (Yi),,P, we first prove the following. 

Claim 3.4. We have 

z/^"({x e T^" I (f^(s„(a;),E,«„(s„)) > r}) < 46"^. 



Proof. Since the metric space (T, n dr) is an M-tree, by virtue of Theorem l3.2l there exists 
a 1-Lipschitz function ipn : (T, n dr) ^ such that 

nCRad((s„),(z/®"),l-/€) 
< CRad((</.„ o s„),(z/«"), 1 - «:) + n Sep ((s„).(z/^"); ^, |) 

+ Sep (^{ipn o s„)4z/'^"); ^, + Sep((v9„ o s„)*(i^^"); 1 - k, 1 - k) 



" 3' 2 



for any k > 0. By Lemma 13. ![ the function tpn ° Sn '■ (T®", dt^) ^ IR is 1-Lipschitz. 
Combining Lemma 12.71 with Lemmas 12. 8[ 12.91 and 12.101 for any /t, /t' > such that 
< K < 1/2, we hence have 

n CRad((s„),(z/^"), I - < CRad((<^„ o s„),(z/®"), 1 - + n Sep ^, |) 

+ Sep((^„o,„),(z/^- ^ 

< ObsCRadM((T®", z/^"); -«:) + 2 Sep (z/®"; |, ^ 

< ObsCRadM((T®^c^,l,z/®");-K) 

+ 2 ObsDiamM((T®^ (i^i, ^^®"); -«72) 

< 5 0bsCRadM((T®", -k72)- 
According to the inequahty (13.11) . we thus get 



nCRad((s„),(z/^"),l - k) < V2n log(4/fi:') 
Letting k' ^ n yields that 



(3.3) CRad((s„)*(z/^"), 1 - /t) < 5DW - log - 

V n K 

for any n G (0, 1/2). Given > 1/2, taking an arbitrary n' G (0, 1/2), we also estimate 
CRad((s„),(z/®"), 1 - k) < CRad((s„),(z/^"), 1 - k') 



2 4 
< 5Dxl-log- 

n K 



< : W-log- 

Vlog4 V ^ fi: 



Letting k' ^ 1/2, we hence get 



(3.4) CRad((s„),(z/®"), l-K)<hD\- log -. 

V n K 

The above two inequalities (13.31) and (13. 4p imply the claim. □ 
Put ttn '■= dT{^u^"{sn),b{h')). By Sturm's inequality (II. ip . we have 

dT(s„(x),6(z/))2dz/®"(x) < - / (iT(x,&(^))'rfz/(x). 
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Lemma [2.31 together with Lemma [2.41 thus imphes that 

al < / dT{Sn{x),b{u)Ydu^^ix) < - / dTix,biu)Ydu{x) < . 



For any r > a„, by using Claim [331 we therefore obtain 

i=l,--- ,n 

= G T®" I dT{sn{x),h{^)) > r}) 

< z/^"({x G T®" I (/T(s„(x),E,«n(s„,)) > r -aj) 

< 4e 75D^ 

< 4e 75c^ e i50D^ 
If r < a„, then we have 



P( <! u; G I c/r I - Yi(uj),Ev(Y^) ] > r}] < e^^oo^e ^Hod^ < e^se ^sod^ < 4e75e ^soo^ 

i=l,--- ,n 

Combining these two inequahties completes the proof of the theorem. □ 

Theorem 11.21 follows from the same proof of Theorem 11.11 together with the inequality 
f l3.2p and the following theorem. We shall consider an mm-space satisfying 

(3.5) axir) < C^e"^-^' 

for some positive constants cx,Cx > and any r > 0. For such an mm-space X and 
m G N, we put 

A^,x := 1 ^ max{e(-^^)^/^2C^e(-^^)^} 

and 

Theorem 3.5 (cf. |H Theorem 1.1]). Let an mm-space X satisfies ( \3.5\\ . N be an Tri- 
dimensional Hadamard manifold, and f : X —>■ N a 1-Lipschitz map. Then, for any 
r > 0, we have 

G X I dN{f{x),E,^{f)) > r}) < mm{Ar,^xe-^'^-/^'^^>\A^,xe-^^-/^''^^>'}. 
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